NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Spectral Guarantees for Adversarial Streaming PCA

Price, Eric; Xun, Zhiyang (October 2024, FOCS 2024)

Full Text Available
Beyond Catoni: Sharper Rates for Heavy-Tailed and Robust Mean Estimation

Gupta, Shivam; Hopkins, Samuel B; Price, Eric (June 2024, Conference on Learning Theory. The 37th Annual Conference on Learning Theory (COLT 2024))

We study the fundamental problem of estimating the mean of a d-dimensional distribution with covariance Σ≼σ2Id given n samples. When d=1, \cite{catoni} showed an estimator with error (1+o(1))⋅σ2log1δn−−−−−√, with probability 1−δ, matching the Gaussian error rate. For d>1, a natural estimator outputs the center of the minimum enclosing ball of one-dimensional confidence intervals to achieve a 1−δ confidence radius of 2dd+1−−−√⋅σ(dn−−√+2log1δn−−−−−√), incurring a 2dd+1−−−√-factor loss over the Gaussian rate. When the dn−−√ term dominates by a log1δ−−−−√ factor, \cite{lee2022optimal-highdim} showed an improved estimator matching the Gaussian rate. This raises a natural question: Is the 2dd+1−−−√ loss \emph{necessary} when the 2log1δn−−−−−√ term dominates? We show that the answer is \emph{no} -- we construct an estimator that improves over the above naive estimator by a constant factor. We also consider robust estimation, where an adversary is allowed to corrupt an ϵ-fraction of samples arbitrarily: in this case, we show that the above strategy of combining one-dimensional estimates and incurring the 2dd+1−−−√-factor \emph{is} optimal in the infinite-sample limit.
more » « less
Full Text Available
Beyond Catoni: Sharper Rates for Heavy-Tailed and Robust Mean Estimation

Gupta, Shivam; Hopkins, Samuel B; Price, Eric (June 2024, COLT 2024)

Full Text Available
Diffusion Posterior Sampling is Computationally Intractable

Gupta, Shivam; Jalal, Ajil; Parulekar, Aditya; Price, Eric; Xun, Zhiyang (July 2024, ICML 2024)

Full Text Available
Sharp Noisy Binary Search with Monotonic Probabilities

https://doi.org/10.4230/LIPIcs.ICALP.2024.75

Gretta, Lucas; Price, Eric (January 2024, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Bringmann, Karl; Grohe, Martin; Puppis, Gabriele; Svensson, Ola (Ed.)
We revisit the noisy binary search model of [Karp and Kleinberg, 2007], in which we have n coins with unknown probabilities p_i that we can flip. The coins are sorted by increasing p_i, and we would like to find where the probability crosses (to within ε) of a target value τ. This generalized the fixed-noise model of [Burnashev and Zigangirov, 1974], in which p_i = 1/2 ± ε, to a setting where coins near the target may be indistinguishable from it. It was shown in [Karp and Kleinberg, 2007] that Θ(1/ε² log n) samples are necessary and sufficient for this task. We produce a practical algorithm by solving two theoretical challenges: high-probability behavior and sharp constants. We give an algorithm that succeeds with probability 1-δ from 1/C_{τ, ε} ⋅ (log₂ n + O(log^{2/3} n log^{1/3} 1/(δ) + log 1/(δ))) samples, where C_{τ, ε} is the optimal such constant achievable. For δ > n^{-o(1)} this is within 1 + o(1) of optimal, and for δ ≪ 1 it is the first bound within constant factors of optimal.
more » « less
Full Text Available
Beyond Catoni: Sharper Rates for Heavy-Tailed and Robust Mean Estimation

Gupta, Shivam; Hopkins, Samuel B; Price, Eric (February 2024, https://arxiv.org/abs/2311.13010)

Full Text Available
A Competitive Algorithm for Agnostic Active Learning

Price, Eric; Zhou, Yihan (December 2023, NeurIPS 2023)

Full Text Available
Minimax-Optimal Location Estimation

Gupta, Shivam; Lee, Jasper CH; Price, Eric; Valiant, Paul (December 2023, NeurIPS 2023)

Location estimation is one of the most basic questions in parametric statistics. Suppose we have a known distribution density f, and we get n i.i.d. samples from f(x − µ) for some unknown shift µ. The task is to estimate µ to high accuracy with high probability. The maximum likelihood estimator (MLE) is known to be asymptotically optimal as n → ∞, but what is possible for finite n? In this paper, we give two location estimators that are optimal under different criteria: 1) an estimator that has minimax-optimal estimation error subject to succeeding with probability 1 − δ and 2) a confidence interval estimator which, subject to its output interval containing µ with probability at least 1 − δ, has the minimum expected squared interval width among all shift-invariant estimators. The latter construction can be generalized to minimizing the expectation of any loss function on the interval width.
more » « less
Full Text Available
Learning a 1-layer conditional generative model in total variation

Jalal, Ajil; Kang, Justin; Uppal, Ananya; Ramchandran, Kannan; Price, Eric (December 2023, Neurips 2023)

Full Text Available
An Improved Online Reduction from PAC Learning to Mistake-Bounded Learning

https://doi.org/10.1137/1.9781611977585.ch34

Gretta, Lucas; Price, Eric (January 2023, Symposium on Simplicity in Algorithms)

Full Text Available

« Prev Next »

Search for: All records